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Abstract 

The purpose of this letter is to show that a rotation of Hartree-Fock canonical 
orbitals which minimizes the lowest eigenvalues of a configuration interaction calcu- 
lation limited to the mono-excited configurations from a given orbital allows one to 
construct a better starting guess for the geminal mean field configuration interaction 
method. 
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1 Introduction 



In the electronic mean field configuration interaction (EMFCI) method [1], group func- 
tions corresponding to a given group structure are optimised, with or without orthogo- 
nality constraint, to represent the lowest states of a given symmetry. The expression of 
an EMFCI wavefunction is not necessarily unique. For example, in the limit case where 
the group structure consists in an antisymmetrised product of one-electron functions, 
i.e. the Hartree-Fock (HF) case, it is well-known that any rotation (respectively unitary 
transformation) does not change (respectively, changes only the phase of) the HF wave 
function. Furthermore, if a restricted HF wave function, is considered as a Grassmann's 
exterior product of geminals: 

"^HF ^ 91^92 /\--- ^9n (1) 

with Qi — ipi A ipj^, the -^j's being for example the canonical HF orbitals, then an even 
larger set of transformations leave the RHF wave function invariant while preserving a 
geminal product structure. As an example, it is well known that this wave function can 

be written as an "antisymmctrizcd geminal product" (AGP) of extreme type [2,3], 

^HF ^ 9 ^9 ^9, (2) 
where g = {n\)-n{g^ + 92^ \- 9n)- 

The non-orthogonal geminal mean field configuration interaction (NO-GMFCI) method 
presented in Ref.[l], which is the simplest case of EMFCI that preserves spin symmetry 
for a system containing an even number of electrons, if pushed to convergence (non- 
orthogonal geminal self-consistent field (NO-GSCF) method) with no basis set trun- 
cation, gives a ground state independent of the particular representation of the guess 
wave function. This is not the case for the excited states. This is not the case either, 
for the ground state, if the basis set of geminals used to represent the geminal mean 
field Hamiltonian is truncated. Such a truncation can be compulsory to deal with large 
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systems. 



Exploratory calculations have shown that the AGP form of the HF wave function, Eq.(2), 
is not a good guess for a molecular system, as in general the convergence of a NOG-SCF 
calculation is slower than with the same HF wave function in canonical form, Eq.(l). It 
will not be investigated in the present work. Similarly, a random rotation of the canonical 
orbitals gives a HF wave function of form, Eq.(l), whose geminals have not proved useful 
as a guess for subsequent NO-GSCF calculations, in general. Other attempts to use the 
eigen geminals of the spin-adapted reduced 2-electron Hamiltonian [4] or some extremal 
electron pairs of Kutzelnigg et al. [5,6] have proved unfruitful so far. Our aim in this 
letter is to demonstrate that the orbitals obtained by a rotation of the canonical HF 
orbitals which minimizes the energy of some specific, mono-excited states, can provide a 
better set of guess geminals than those constructed over canonical orbitals. 

The letter is organized as follows: In the next section, we explain how we define "mono- 
excitation optimized HF orbitals" (MOHFOs). Then, in Section 3, we compare MOHFOs 
based geminals with canonical orbital based geminals on simple examples. Finally, we 
present the conclusions we have drawn from these examples. 



2 Removing HF orbitals cirbitrariness 

An EMFCI calculation step consists essentially in a limited configuration interaction 
(CI) calculation, in the space spanned by the Grassmann product of a set of "active" , 
contracted functions, $5) • • • > with a fixed Grassmann product of spectator, con- 
tracted function, \E'2 A ■ ■ ■ A That is to say, the solution of an EMFCI step is obtained 
by the diagonalisation of a Hamiltonian matrix, H, whose elements are defined, in Dirac 
notation, by, Vi, j e {0, • • • , M}, 

Hij = ($i A *° A • • • A A ^° A • • • A *°), (3) 
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where H also denotes the Hamiltonian operator in n-electron space. In a general EMFCI 
calculation, the numbers of electrons, nj, i e {1, ■ ■ ■ ,r}, of the r contracted functions 
in each Grassmann product have only to satisfy the constraint that their sum equal the 
total number of electrons in the system. 

An unrestricted HF (UHF) calculation for an n-electron system can be seen as a partic- 
ular case of EMFCI where at each step: r — n, Ui — 1, Vi e {1, . . . , n\, the spectator, 
contracted functions arc n — 1 occupied spin-orbitals, \E'^ = ipi^ Vi G {2, . . . , n}, and the 
basis set of active, contracted functions span the orthogonal complement, $5 = V-'i- = 
. . . , = -0^. Here, m is the number of spin-orbitals, and M = m — n. By iterating 
such a step with the active spin-orbital corresponding to the approximate ground state 
becoming spectator and a new, occupied spin-orbital becoming active, one can converge 
towards the UHF solution. 

The stationarity of the ground state energy in this algorithm is clearly equivalent to the 
Brillouin condition [7]. This algorithm has the advantage that the approximate excited 

M 

states obtained by diagonahsation of the Hamiltonian matrix, $i A • • • A = 



{a\ipi + a^ipn+i + ■ — I- oflt^fn) A ^/'2 A ■ ■ ■ A ipn, A; G {1, . . . , M}, have a physical relevance 
and define a set of virtual spin-orbitals, (a^V^i -|- a2V'„+i -|- • — h a'i^iprn), k e {!,..., M}, 
such that in addition to Brillouin's theorem, the Hamiltonian matrix elements between 
the mono-excited configurations from ■^i towards these virtuals are zero. 

In the case of a converged, RHF, 2n-electron wave function, a CI calculation in a basis 



set of the form V^iAV^^ A- • • A^^.i A^„_i A r^'''''^''+^^^"+^'''^" , . . . , V^i A V^i A • • • A V^„_i A 



set by Brillouin's theorem and there is no need to add it), provides eigenf unctions. 



V2 




m—n 



^ aj V'l A V'l A • • • A ijjn-i A V'„_i A 



( 



Ipn A i>n+j + Ipn+j A -0, 

^/2 



n 
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m—n \ \ 



V2 



(4) 



and defines a set of virtual spin-orbitals, E i^n+j, such that in addition to Brillouin's 
theorem, the Hamiltonian matrix elements between the mono-excited configurations from 
ipn towards these virtuals are zero. 

However, the arbitrariness recalled in introduction remains for the choice of the occupied 
orbitals of a given symmetry. A natural way to remove this arbitrariness following a vari- 
ational approach is to minimize the second lowest eigenvalue of the CI calculation in the 
basis set R{^i)AR{^^)A---AR{^n-i)AR{^,,_^)A(^^^''^^ , . . . A 

A • • • A Riipn-i) A R{^n-i) A j^ ^(^")^^'n4|n.AR(i^j ^ ^.^j^ rcspcct to the rotation of 
the occupied orbitals R. Note that, in fact, this rotation can be restricted to the orbitals 
carrying a given irreducible representation in case of a non trivial spatial symmetry. 
Then, one can minimize the next lowest eigenvalue, and so on, until the occupied or- 
bitals rotation freedom is exhausted. 

This approach is equivalent to the extended Hartree-Fock (EHF) method of Morokuma 
[8], except that in our case, each CI calculation gives also a complete set of virtuals in 
one-to-one correspondance with the n-electron eigenfunctions of the CI matrix. However, 
one can first use Morokuma's method and then perform a CI with the optimized orbitals 
to obtain also the same final set of virtuals. This is, in fact, a practical way to proceed. 
This way, not only one removes the arbitrariness in the HP orbitals and obtains what 
we call "mono-excitation optimized HF orbitals" (MOHFOs), but also one can exploit 
the one-to-one correspondance between virtual orbitals and eigenvalues of a bona fide n- 
electron CI matrix to select a reduced set of the former. In the next section, we compare 
canonical HF orbitals (CHFOs) with MOHFOs to construct guess geminals for the NO- 
GMFCI method. 
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3 Compcirison of canonical versus mono-excitation optimised orbitals 

Following the EMFCI line of thought and trying to improve the description of excited 
states, it is natural to use MOHFOs rather than CHFOs. Furthermore, it might be 

necessary to truncate the HF orbital basis set to deal with large systems and the energies 
of the mono-excited configurations associated with the virtuals can be used to discard 
those that are above a given threshold. However, we must ascertain the superiority of 
MOHFOs on simple examples. 

MOHFOs and CHFOs differs both by their occupied and virtual orbitals. The effect of 
the optimization of both orbital types can be analysed separately By using a minimal 
basis set, the improvement due to virtual orbital optimisation can be quenched, because 
with such a basis set, there is usually only one virtual orbital with the appropriate 
symmetry to build the relevant, mono-excited configurations. In contrast, the number of 
variational degrees of freedom available for the optimisation of the occupied orbitals is 
not affected by the size of the basis set. It only depends upon the number of occupied 
orbitals of each symmetry. So, we will first investigate the effect of optimising occupied 
orbitals on several systems with a ST0-3G basis set. This has the additional advantage 
that full CI (FCI) calculations can be performed for comparison. 

The simpler example one can think of is arguably LiH. But in this system, the difference 
between CHFOs and MOHFOs is extremely small, because the only variational degree of 
freedom is the rotation between the core orbital and the valence one, and as one expect 
these two orbitals do not mix. Many other larger systems exhibit the same feature because 
of symmetry properties of the occupied orbitals. For example, the HF ground state 
energy of Li2 at the intcrnuclear distance of 2 bohr is -14.395113 a.u. and the first ^S"*" 
mono-excited state energy is -14.094230 a.u. by using the HF canonical orbitals. By 
optimizing the rotation of the core orbital and the unique valence orbital allowed to mix 
by symmetry the energy only decreases down to -14.094435 a.u. . Results for this type 
of systems are not reported further. 
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The simpler system where at least two valence orbitals can possibly mix to produce 
MOHFOs significantly different from CHFOs is BH. It is iso-electronic to Li2 but has 
only Coov symmetry instead of Dooh- Results for the first ^11 state of this system are 
reported in Table 1 alongside with the low-lying excited states of water. The first 
state of BH correspond to the excitation of the highest a orbital towards the lowest tt 
orbitals. As there is only one set of virtual tt orbitals in the ST0-3G basis set, MOHFO 
and CHFO sets differ only by their occupied orbitals. After minimizing the lowest ^11 
excited state energy, it remains possible to minimize the next excited state energy with 
respect to a rotation of two occupied a orbitals without changing the already minimized 
ground state and first excited state energies. However, this proves to have a negligeable 
lowering effect on the energy value, because the remaining two a orbitals do not mix, 
having very different natures, one being a core orbital and the other a valence orbital. 
The optimisation of the occupied orbitals of BH reduces the excited state energy by 
about 5 mhartree which is more than an order of magnitude larger than in Li2. This 
energy lowering is preserved after a NO-GSCF calculation, although the MOHFO and 
CHFO ground states are no longer identical. A linear dependency threshold of 10~^ was 
used in the NO-GSCF, that is to say, the geminal functions whose norm after projection 
in a Gram-Schmidt orthogonalisation process arc less than 10^^, were eliminated from 
the geminal basis set at each GMFCl step. Such a low tolerance prevented the gemi- 
nal excited states from loosing their identity as approximate excited states of the whole 
system during the GMFCI iterations. The LCI (CI hmited to mono-excitations from a 
given orbital) vertical excitation energy is worse with MOHFOs than with CHFOs com- 
pared with FCl, however, for the more accurate NO-GSCF calculation it is significantly 
improved with MOHFOs with respect to CHFOs. 

In the ST0-3G basis set, H2O has only one ai virtual orbital, so, in the optimization 
of MOHFOs for the first ^Ai excited state, (which corresponds to the excitation of the 
highest oi occupied orbital to the oi virtual), there cannot be any effect from a symmetry 
preserving rotation of virtual orbitals. Therefore again, the difference between MOHFOs 
and CHFOs depends only upon the rotation of the occupied orbitals. There are three 
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occupied oi molecular orbitals (including one core orbital) that can be rotated by the 
EHF process. So the variational freedom is equivalent to that of the BH system already 
studied. And similarly to BH, after minimizing the lowest ■'^Ai excited state energy, it 
remains possible to minimize the next excited state energy with respect to a rotation 
of two occupied ai orbitals without changing the ground state and first excited state 
energies. However, this proves to have a negligeable lowering effect on the energy value. 

Table 1 shows that the first ^Ai excited state is lowered by about 5 millihartrees with 
MOHFOs. So, a given variational freedom seems to produce an energy lowering always 
of the same order of magnitude for the optimised level. In contrast, the lowest ^Bi state 
energy, which correspond to the excitation of the optimised ai occupied orbital towards 
the bi virtual orbital, increases by about 7.5 millihartrees. Therefore, the energy differ- 
ence between these two states is significantly improved with MOHFOs when compared 
to the FCI result. The ^B2 and ^A2 states do not correspond to an excitation from the 
highest occupied ai orbital, and are essentially unaffected by the optimisation of the 
latter. 

Now let us turn to the effect of optimising virtual orbitals. This can be analysed on a 
fixed system, (so that the variational freedom for the optimisation of the occupied is 
fixed), by varying the basis set. The system must be large enough to justify a trun- 
cation of the virtual orbital set before the NO-GSCF calculation, since we want to 
compare the effect of such a truncation with both orbital guesses. For this study, we 
have chosen the classical example of ozone. The low-lying singlet and triplet excited 
states of ozone have been the subject of numerous theoretical and experimental inves- 
tigations [9,10,11,12,13,14,15,16,17,18,19] to quote a few, because of the importance of 
this molecule in the upper athmosphere. 

In this study, we will take as references the two most recent theoretical studies which 
are the only ones predicting the correct order of the lowest singlet energy levels of the 
Chappuis band [16,17]. We focus attention on the ^Ai ground state and the two lowest 
singlet excited states of ozone. According to Table 5 of [16], the next excited state in 
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increasing order of energy is dominated by a diexcited configuration whicfi corresponds 
to tfie excitation of HF orbitals pertaining to two different geminal groups. Therefore, 
it cannot be described properly within the GMFCI approach and would require a more 
general EMFCI calculation. 

The first singlet excited state carries the Bi representation of the symmetry group 
and is dominated by the excitation of the highest oi occupied orbital to the lowest 61 
virtual orbital. The symmetry preserving EHF process mimimizes the energy of this state 
by rotating the 6 occupied ai orbitals and the virtual bi orbitals. The HF wave function 
has only one occupied orbital carrying the a2 and the bi representations. Therefore 
no symmetry preserving rotation of these occupied orbitals can affect an excited state 
energy. In contrast, there are four 62 occupied HF orbitals to consider. The second singlet 
excited state carries the A2 representation and is dominated by the excitation of the 
highest 62 occupied orbital to the lowest bi virtual orbital. So, we use the minimization 
of its energy with respect to a rotation of the four occupied 62 HF orbitals to remove, 
partly, the arbitrariness of the latter. The virtuals obtained from the first excited state 
minimization are not reoptimised. (In fact, we have shown that reoptimising them was 
not significantly improving the A2). 

The results are displayed in Table 2 for the triple zeta valence basis set plus polarization 
and Rydberg functions (pVTZ+Ryd) used in [16], and for the augmented correlation 
consistent triple zeta valence basis set plus polarization (aug-cc-pvtz) used in [17]. The 
effect of the EHF optimization is important as seen from the comparison of the two 
LCI columns. In the "stepO" columns we have performed GMFCI calculations taking 
successively as active the electron pairs occupying the highest energy geminals made 
of, respectively, ai and 62 orbitals, to obtain the vertical excitation energies from the 
ground state to, respectively, the lowest ^Bi and ^A2 states. These calculations are "step 
0" GMFCI because the spectator geminals are the guess geminals constructed either 
from MOHFOs or CHFOs. 

To appreciate the advantage of MOHFOs over CHFOs, the GMFCI calculations have 
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been performed after truncating the 81 orbital basis functions of the pVTZ+Ryd basis as 
well as the 165 orbital basis functions of aug-cc-pvtz to only 18 orbitals. For MOHFOs, 
these 18 orbitals correspond to the occupied orbitals and the virtuals whose associated 
the lowest CI energies. For CHFOs, they are the 18 orbitals corresponding to the dom- 
inant component of the selected MOHFOs. Larger sets of virtuals would lead to more 
accurate results but would reduce the difference between MOHFOs and CHFOs (in the 
limiting case of the absence of truncation, virtual orbital optimisation would have no 
effect on calculations which treat equivalently all the virtuals) . 

All the GMFCI energy differences increase slightly with respect to the LCI calculations 
because, for symmetry reasons, in the small truncated orbital basis set, there are many 
more di-excited Ai geminals interacting with the ground state geminal than Bi or A2 
geminals interacting with the Bi or A2 excited states considered. So, the energy low- 
ering is more important for the ground state than for the excited states. However, the 
improvement with MOHFOs of the excitation energies, compared to the MCSCF/SDCI 
values of [16] observed at the LCI level, remains essentially preserved in step GMFCI 
calculations. Our results are somewhat further away from the reference results with the 
larger basis set. However, similar observations can be done for this basis set, in particular 
the difference between MOHFOs and CHFOs is of the same order of magnitude despite 
the more drastic truncation of virtuals. So for virtuals, more variational freedom does not 
imply necessarily better MOHFOs with respect to CHFOs. Note that the right ordering 
of the energy levels is obtained in all our calculations. 

Conclusion 

The present work shows that, the freedom one has to rotate CHFOs, already exploited 
by Morokuma and Iwata in the 1970's, can be taken advantage of to obtain improved 
guess geminals for the GMFCI method. The orbitals obtained by minimizing the energy 
of some specific mono-excited states allow one to truncate the initial orbital basis set, 
used to build the geminal basis functions, and/or to truncate the geminal basis set itself 
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(by means of the linear dependency cutoff), and then, to perform GMFCI calculations 
of a better accuracy on the targeted spectral transitions than the same GMFCI with the 
initial CHFOs. 

Of course, this conclusion on the numerical usefulness of MOHFOs for the GMFCI 
method should be ascertained by investigating more systems. However, from a purely 
theoretical standpoint, MOHFOs are clearly more appealing than CHFOs, since the 
arbitrariness of the latter can be removed by a simple application of the variational 
principle which is natural within the EMFCI philosophy, where a SCF process is seen as 
succession of MFCI steps. 
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Table 1: Vertical excitation energies from ground state in Hartree, (ST0-3G calculations; 
BH equilibrium bond length of 1.2324 angstrom from Ref. [20]; H20 equilibrium bond 
length of 0.958 angstrom, bond angle of 104.4776 degrees from Ref. [21]. 
Limited configuration interaction (LCI), refers to a CI limited to the mono-excitations 
from the highest ai orbital, therefore, the ground state value is the HF energy for both 
sets of orbitals. GMFCI refers to a crude NO-GSCF calculation with linear dependency 
threshold set to 10~^. The states whose energy is italicized are the ones used to opti- 
mise the occupied orbitals. For water, the energy gap between the ^Ai and ^Bi excited 
states, affected by the orbital optimisation, increases with MOHFOs and compares more 
favorably with the FCI reference. (See main text for the acronyms.) 
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Table 2: Lowest singlet energy levels (Chappuis band) with respect to the ground state 
for ozone, in eV. The pVTZ+Ryd basis set and corresponding equilibrium geometry are 
taken from Ref. [16]: bond length of 1.2924 angstrom, bond angle of 116.53 degrees, 
3s2p2d Rydbcrg functions on the central oxygen nucleus only with exponents 0.021, 
0.008, 0.0025, 0.017, 0.09, 0.015, 0.008 au, respectively The aug-cc-pvtz basis set and 
corresponding equilibrium geometry are taken from Ref. [17]: bond length of 1.2822 
angstrom, bond angle of 116.80 degrees. Step refers to the initialisation step of a 
GSCF calculation where each geminal CI is performed in the mean field of the guess 
geminals for the spectator groups. The advantage of the MOHFOs, over the CHFOs is 
preserved in the step GMFCI calculations. (See main text for the acronyms.) 
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